A comparison of incomplete-data methods for categorical data
نویسندگان
چکیده
منابع مشابه
Handling Incomplete Categorical Data for Supervised Learning
Classification is an important research topic in knowledge discovery. Most of the researches on classification concern that a complete dataset is given as a training dataset and the test data contain all values of attributes without missing. Unfortunately, incomplete data usually exist in real-world applications. In this paper, we propose new handling schemes of learning classification models f...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملA Comparison of Categorical Attribute Data Clustering Methods
Clustering data in Euclidean space has a long tradition and there has been considerable attention on analyzing several different cost functions. Unfortunately these result rarely generalize to clustering of categorical attribute data. Instead, a simple heuristic k-modes is the most commonly used method despite its modest performance. In this study, we model clusters by their empirical distribut...
متن کاملGraphical Methods for Categorical Data
The graphical displays shown here are implemented in SAS/IML software whose combination of matrix operations, built-in Statistical methods for categorical data, such as loglinear models functions for contingency table analysis, and graphics provide a and logistic regression, represent discrete analogs of the analysis of convenient environment for graphical display for multiway variance and regr...
متن کاملGraphical Methods for Categorical Data
Statistical methods for categorical data, such as loglinear models and logistic regression, represent discrete analogs of the analysis of variance and regression methods for continuous response variables. However. while graphical display techniques are common adjuncts to analysis of variance and regression, methods for plotting contingency table data are not as widely used. This paper provides ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Statistical Methods in Medical Research
سال: 2012
ISSN: 0962-2802,1477-0334
DOI: 10.1177/0962280212465502